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ABSTRACT 

Spliceosomes remove introns from primary gene 
transcripts. They assemble de novo on each intron 
through a series of steps that involve the incorpor- 
ation of five snRNP particles and multiple 
non-snRNP proteins. In mammals, all the intermedi- 
ate complexes have been characterized on one 
transcript (MINX), with the exception of the very 
first, complex E. We have purified this complex by 
two independent procedures using antibodies to 
either U1-A or PRPF40A proteins, which are known 
to associate at an early stage of assembly. We dem- 
onstrate that the purified complexes are functional 
in splicing using commitment assays. These 
complexes contain components expected to be in 
the E complex and a number of previously unrecog- 
nized factors, including survival of motor neurons 
(SMN) and proteins of the SMN-associated 
complex. Depletion of the SMN complex proteins 
from nuclear extracts inhibits formation of the E 
complex and causes non-productive complexes to 
accumulate. This suggests that the SMN complex 
stabilizes the association of U1 and U2 snRNPs 
with pre-mRNA. In addition, the antibody to 
PRPF40A precipitated U2 snRNPs from nuclear 
extracts, indicating that PRPF40A associates with 
U2 snRNPs. 

INTRODUCTION 

In eukaryotes, the majority of primary gene transcripts 
(pre-mRNA) undergo splicing, a process that removes 
introns and joins exons to produce messenger RNA 
(mRNA). Splicing is catalysed by the spliceosomes, 
which contain five small ribonucleoprotein particles (Ul, 



U2, U4, U5, and U6 snRNPs) and many non-snRNP 
proteins (1,2). The assembly of spliceosomes has been 
studied in most detail on transcripts containing a 
minimal functional unit (exon-intron-exon). 
Spliceosomes assemble in a series of consecutive steps 
that produce complexes E, A, B and C. First, in the E 
complex, the 5'- and S'-splice sites (SS) of an intron are 
recognized by the specific binding of the Ul snRNP and 
the proteins U2 auxiliary factor (U2AF), respectively. 
Significantly, the pre-mRNA substrate is committed to 
the splicing pathway and the splice sites are in close prox- 
imity (3,4). The complex contains the U2 snRNP particle 
as a component, which is essential for its formation (5-7). 
At this stage, association of the U2 snRNP with the 
complex is weak but the underlying mechanism is not cur- 
rently understood in detail. The next complex to form is 
complex A, which requires ATP. In this complex, the U2 
snRNP is bound stably by base pairing to the branchpoint 
sequence, and U2-associated proteins of the SF3A/B 
complexes are bound to the anchoring site upstream of 
the branchpoint (8). This conformation serves as a 
binding platform for the U4/U6.U5 tri-snRNP, which cul- 
minates in the formation of complex B. The fully 
assembled spliceosome contains all five snRNPs and 
becomes competent for splicing through a series of re- 
arrangements. These rearrangements result in the dissoci- 
ation of Ul and U4 snRNPs and the formation of the 
catalytic centre for the first transesterification reaction, 
in which the 5'-exon is displaced and the lariat intron is 
formed. This produces complex C. The second transester- 
ification reaction results in intron removal and the joining 
of exons (1,9). 

The components of complexes A, B and C have been 
characterized in greatest detail on a transcript named 
MINX, which is derived from adenovirus sequences 
(10-19). However, the first complex in this series, E, has 
not been purified and characterized. The only E complexes 
characterized in any detail were assembled on substrates 
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containing a neuron-specific exon, the Nl exon of c-src 
pre-mRNA (20,21). The protein composition of 
complexes formed on these transcripts provided important 
insights into the mechanism by which the exon is re- 
pressed, but it is not clear whether these complexes fit 
into the constitutive assembly pathway defined by 
MINX. For example, it is not clear whether the process 
of assembling complex A involves the same steps for c-src 
in WERI extracts as for MINX in HeLa. The progression 
from E complex to A complex can be understood only by 
determining the composition of both complexes on a 
common pre-mRNA. For this reason, we have purified 
and characterized complex E formed on MINX RNA in 
HeLa nuclear extracts. 

The E complex we purified contains some factors in 
common with the A complex. In addition, we identified 
novel components that are specific for the E complex. 
These include the proteins of the survival of motor 
neurons (SMN) complex. Our data suggest that the 
SMN complex proteins are required for stabilizing the 
interactions between Ul and U2 snRNP with 
pre-mRNA in the E complex. Moreover, using a 
PRPF40A-specific antibody, we purified U2 snRNP 
complexes that contained PRPF40A and the SWI/SNF 
chromatin remodelling complex proteins. The E complex 
appears to be assembled from three principal 
sub-complexes: the Ul snRNP, the U2 snRNP and the 
SMN-associated complex. 

MATERIALS AND METHODS 

Antibodies 

Rabbit polyclonal antibodies were raised (by Eurogentec) 
against peptides of U1A (amino acid 1-14), PRPF40A 
(amino acid 380-394), SF3A2 (amino acid 444-458) and 
purified using a SulfoLink column (Pierce) containing 
the antigenic peptides according to the manufacture 
procedure. The antibodies to SMN1 and GEMIN3 
were purchased from BD Bioscience (manufacturer 
number 610647 and 612152, respectively). The antibodies 
to SIP1 (MANSIP1A), GEMIN5 (GEM5M), GEMIN6 
(GEM6B) and GEMIN7 (GEM7B) were kindly 
provided by the MDA Monoclonal Antibody Resource 
(22). The antibodies to ACTL6A [BAF53 (N-19): 
sc-47808] and DDX15 [DDX15 (T-20): sc-67550 and 
(C-16): sc-67547] were purchased from Santa Cruz 
Biotechnology. 

In vitro transcription and splicing 

MINX pre-mRNA was synthesized from a PCR product 
template of pMINX plasmid (23) using MEGAshortscript 
kit (Ambion). [ 32 P]-labelled MINX pre-mRNA was 
synthesized from the same template as described before 
(specific activity 315 000 cpm/pm) (24) and mixed with 
unlabelled MINX for easier monitoring of complexes. 
HeLa nuclear extract was prepared according to Dignam 
et al. (25). Splicing reactions were assembled either with 
the isolated E complex or an equivalent amount of naked 
pre-mRNA in the presence of 33% nuclear extract, treated 
or untreated with oligonucleotides/antibodies, 2.5 mM 



MgCl 2 , 2mM ATP and lOmM creatine phosphate. 
Reactions were incubated at 30°C for the times indicated. 
RNA was extracted and analysed by 14% PAGE followed 
by radiography. 

Purification of the E complex with the antibody to Uf A 

To assemble E complex, nuclear extract was first dialysed 
against 20 mM HEPES (pH 7.9), 20% glycerol (26). A mix 
(2.4 ml) containing 40% nuclear extract and 65 mM KC1 
was depleted of endogenous ATP by incubating at 30°C 
for 20min and then 15nM MINX pre-mRNA was added. 
After assembly at 30°C for lOmin, the reaction was loaded 
on six 4 ml 10-50% glycerol gradients containing 20 mM 
HEPES (pH 7.9), 65 mM KC1 and centrifuged in a Sorvall 
TH660 rotor at 29 000rpm for 14 h. Gradients were 
fractionated manually into 175 ul aliquots and counted 
by Cherenkov. The major pre-mRNA peak, fractions 
14-19, were combined and dialysed against 21 of IP150 
(20 mM HEPES, (pH 7.9), 150mM NaCl, 0.05% NP-40, 
0.5 mM DTT) for 3h. For IPs, 400 ul of Protein A 
Sepharose (PAS, GE Healthcare) were charged with 
150ug of an antibody to U1A and pre-blocked with 
0.3mg/ml BSA and 50 ug/ml yeast tRNA. To each 30 ul 
aliquot of beads 1.5 ml of dialysed gradient fractions were 
added and incubated with rotation at 4°C for 3h. 
Following six washes with IP 150, complexes were eluted 
into 800 ul of the same buffer containing 0.6mg/ml anti- 
genic peptide. For analysis of complexes, the eluted 
material was loaded on two 4 ml 10-50% glycerol gradi- 
ents containing 20 mM HEPES (pH 7.9), 65 mM KC1 and 
centrifuged in a Sorvall TH660 rotor at 26 300rpm 
for 13 h. 

RNA was extracted from each gradient fraction with 
phenol/chloroform/isoamylalcohol (PCA, 25:24:1) and 
precipitated with ethanol. RNA was separated by 10% 
denaturing PAGE and visualized by silver staining. For 
identification of the protein composition by MS, corres- 
ponding gradient fractions were combined, diluted 
two-fold with 20 mM HEPES (pH 7.9), 150mM NaCl 
and complexes were pelleted in a Sorvall TH660 rotor at 
60 000 rpm for 6 h. The pellet was re-suspended in the re- 
maining 60 ul of solution and the proteins were reduced in 
the presence of 1% SDS and 100 mM DTT at 37°C for 
45 min. 

For western blotting, eluted complexes were loaded on 
one 4 ml 10-30% glycerol gradient and 90 ul of each 
fraction were separated by 10% SDS-PAGE, transferred 
to a nitrocellulose membrane and probed with antibodies. 

Purification of the E complex with the antibody to 
PRPF40A 

Four hundred microlitres of the pre-blocked PAS were 
charged with 170 ug of affinity purified antibodies. IP 
was carried out from the E complex-assembly reaction 
(2.4 ml) diluted with 1ml of IP 150 buffer as described 
before. Complexes were eluted in 1.6 ml of IP150 contain- 
ing 0.6mg/ml antigenic peptide. The eluate was loaded on 
four 4 ml 10-50% glycerol gradients containing 20 mM 
HEPES (pH 7.9), 65 mM KC1 and centrifuged in a 
Sorvall TH660 rotor at 27 400 rpm for 12h. Gradient 
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fractions were measured by Cherenkov counting and 
RNA was analysed by denaturing PAGE. For western 
blotting, 90 ul of each fraction were loaded directly on 
10% SDS-PAGE and processed as described above. 

Mass spectrometry of the E complex 

Of each complex, 1 .26 pmoles isolated with antibodies to 
PRPF40A and U1A were separated by 8/14% SDS- 
PAGE and stained with colloidal Coomassie G250. Each 
gel lane was cut into 54 slices and an in-gel trypsin digest 
was carried out. LC-MS/MS was performed upon each 
sample using a 4000 Q-Trap mass spectrometer (Applied 
Biosystems, Warrington, UK). Peptides resulting from 
in-gel digestion were loaded at high flow rate onto a 
reverse-phase trapping column (0.3 mm i.d. x 1 mm), con- 
taining 5 um CI 8 300 A Acclaim PepMap media (Dionex, 
UK) and eluted through a reverse-phase capillary column 
(75 urn i.d. x 150 mm) containing Waters Symmetry 
CI 8 100 A media (Waters, UK) that was self-packed 
using a high pressure packing device (Proxeon 
Biosystems, Odense, Denmark). The output from the 
column was sprayed directly into the nanospray ion 
source of the 4000 Q-Trap mass spectrometer. Fragment 
ion spectra generated by LC-MS/MS were searched using 
the MASCOT (27) search tool against the National Center 
for Biotechnology Information non-redundant protein 
database using appropriate parameters: peptide mass tol- 
erance = 0.4 Da; fragment mass tolerance = 0.6 Da; the 
enzyme was selected as trypsin with one missed cleavage 
site considered; carbamidomethyl (C) was selected as a 
fixed modification and oxidation (M) as a variable modi- 
fication. The criteria for protein identification were based 
on the manufacturer's definitions (Matrix Science Ltd.). 
Basically candidate peptides with probability-based 
Mowse scores exceeding threshold (P < 0.05), and thus 
indicating a significant or extensive homology were 
referred to as 'hits'. The highest scoring fragment ion 
spectrum from each protein hit was manually inspected 
to ensure its validity and thus minimize potential false 
positives in the data set. 

Depletion of HeLa nuclear extracts 

To deplete Ul and U2 snRNPs, 100 ul of nuclear extract 
in 20 mM HEPES (pH7.9), 65 mM KC1, 2.2 mM MgCl 2 , 
0.2mM EDTA, 0.5mM PMSF, 0.1 mM DTT, 20% 
glycerol were incubated in the presence of 1 ug each oligo- 
nucleotide against Ul and U2 snRNAs (U2-E15 and 
Ul-5'C) and 2U of RNsae H (Promega) for 1.5 h as 
described previously (28). To analyse digestion of the 
targeted snRNP particles, the RNA was extracted from 
10 ul aliquots and analysed by 10% PAGE followed by 
silver staining. For SMN depletion, 200 ul of PAS were 
charged with 40 ug of an antibody to SMN and 
pre-blocked with 0.3mg/ml BSA and 50 ug/ml yeast 
tRNA. Two hundred microlitres of Hela nuclear extracts 
at either 200 or 700 mM NaCl were treated twice with 
PAS-SMN antibody-conjugate or PAS only as a 
control. Ten microlitres of extract were used for western 
blot analysis. The extracts were dialysed against buffer 
containing 20 mM HEPES (pH 7.9), 100 mM KC1, 



0.5 mM DTT, 15% glycerol and used at concentrations 
of 40% for splicing and the E complex formation. 

Analysis of E complex formation in the SMN-depleted 
extract by glycerol gradient centrifugation 

The E complexes were assembled in a reaction containing 
40% nuclear extract, 65 mM KC1, 10 nM MINX. The re- 
actions were diluted two-fold with chilled buffer contain- 
ing 20 mM HEPES (pH 7.9), 65 mM KC1, loaded on the 
10-50% glycerol gradient and centrifuged in TH660 rotor 
at 24 500rpm for 15h. 

Isolation of U2 snRNPs with the antibody to PRPF40A 

The IP was carried out using 1 50 ul of pre-blocked PAS 
charged with 90 ug of affinity purified antibody to 
PRPF40A from the reaction (300 ul) containing 40% 
nuclear extract, 20 mM HEPES (pH 7.9), 65 mM KC1 
and depleted of endogenous ATP by incubating at 30°C 
for 20min. IP was carried out in the IP 150 buffer at 4°C 
for 2.5 h. The bound material was eluted in 400 ul of IP150 
containing 0.6mg/ml of antigenic peptide at 4°C for lh 
and loaded on the 4 ml 10-30% glycerol gradient contain- 
ing 150mM NaCl. Gradients were centrifuged in a Sorvall 
TH660 rotor at 24 800rpm for 15h. RNA from the 
gradient fractions was extracted and analysed by 10% 
PAGE followed by silver staining. Proteins from fractions 
8-11 were precipitated with acetone, re-suspended in 
SDS-loading buffer and separated by 10/13% SDS- 
PAGE for MS analysis. 



RESULTS 

Isolation of the E complex 

Spliceosomes are dynamic molecular machines that 
undergo multiple rearrangements during the splicing 
cycle. Their heterogeneity is a well-documented problem 
for structural studies (29,30). Therefore, to purify the E 
complex, we used antibodies to proteins that associate 
with the spliceosome transiently (16). We selected the Ul 
snRNP as it is a major factor that triggers spliceosome 
assembly and then dissociates from the complex before 
the catalytic spliceosome is formed. We raised an 
antibody to the Ul snRNP-associated protein U1A 
(encoded by the SNRPA gene), for which few interacting 
partners are known and the chances of being accessible to 
an antibody are higher. We also targeted PRPF40A 
(FBP11), a homologue of the essential yeast Prp40p 
protein, which has been proposed to bridge the nitron' s 
splice sites. The peptide antibodies raised against these 
proteins specifically recognized the corresponding poly- 
peptides on western blots of the HeLa nuclear extract 
and efficiently precipitated pre-mRNA from splicing reac- 
tions (data not shown). 

The E complex was assembled using MINX pre-mRNA 
in HeLa nuclear extracts in the absence of ATP (5,31). 
Before immunoprecipitation (IP) of complexes with the 
UlA-specific antibody, the E complex-assembly reaction 
was fractionated by 10-50% glycerol gradient centrifuga- 
tion to separate spliceosomal complexes from free Ul 
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snRNP particles. The concentration of Ul snRNPs in 
HeLa nuclear extracts is ~200nM, at least an order of 
magnitude greater than the concentration of the 
pre-mRNA (32). The position of complexes in the 
gradient (fractions 14-19) was determined by monitoring 
the distribution of radioactively labelled pre-mRNA by 
Cherenkov counting (Figure la). The sedimentation 
value of complexes was determined by comparison with 
the 30S and 50S ribosomal subunits and corresponds to 
35-40 S, which is much larger than the 10S of Ul snRNPs. 
Complexes were isolated from fractions 14-19 using 
UlA-specific antibody immobilized on PAS and eluted 
from the column with the antigenic peptide to minimize 
any unspecific background. To prove the integrity of the 
isolated complexes, the eluted material was analysed again 
by 10-50% glycerol gradients. The material migrated as a 
single peak (Figure lb; fractions 12-17), indicating that 
the isolated complexes are stable and homogeneous. The 
RNA was extracted from the gradient fractions and 
analysed by denaturing PAGE followed by silver 



staining (Figure lc). Fractions 12-17 contained 
pre-mRNA and stoichiometric amounts of Ul and U2 
snRNAs, which are characteristic of the E complex. 
Some Ul snRNPs were detected on the top of the 
gradient (Figure lc; fractions 3-6). It is possible that 
some free Ul snRNPs were present in the E complex frac- 
tions and were precipitated independently of the 
spliceosomal complexes (33). 

An alternative method for purification of the E complex 
began with direct addition of the assembly reaction to 
PAS beads charged with an antibody to PRPF40A. 
Complexes eluted from the beads using the antigenic 
peptide were further purified by glycerol gradient centri- 
fugation under the same conditions as before. 
Significantly, both purifications resulted in the isolation 
of complexes with similar migration abilities (Figure lb). 
RNA analysis of the gradient fractions showed that the 
isolated complexes contained pre-mRNA and Ul and U2 
snRNPs (Figure Id; fractions 12-17). In addition, a sig- 
nificant amount of U2 snRNA was found in fractions 
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Figure 1. Isolation and characterization of the E complex, (a) The E complex was assembled on MINX pre-mRNA in HeLa nuclear extract and the 
reaction was fractionated on a 10-50% glycerol gradient. The gradient fractions were counted by Cherenkov and the position of the labelled 
pre-mRNA is indicated. The major peak, fractions 14-19, was used in IPs with the UlA-specific antibody, (b) Complexes isolated with the 
UlA-specific antibody (grey circles) and with the PRPF40A-specific antibodies (black circles) were analysed by 10-50% glycerol gradients. Both 
purifications yielded complexes with similar 35-40S values. The positions of the E. coli 50S and 30S ribosomes are indicated, (c) RNA was recovered 
from the gradient fractions of the UlA-specific isolation, analysed by the 10% denaturing PAGE and visualized by silver staining, (d) The RNA 
profile from the gradient fractions of complexes isolated with the antibody to PRPF40A. Total RNA extracted from the nuclear extract was used as 
a marker (M). The identity of RNAs is indicated on the left. The asterisks correspond to the 5S and 5.8S ribosomal RNAs. 
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7-10 (Figure Id). This peak was absent from the 
UlA-associated complexes (Figure lc) and is therefore 
unlikely to have arisen from dissociation during centrifu- 
gation. Indeed, we infer that these U2 snRNPs were in 
distinct complexes that were also selected by the 
PRPF40A-specific antibody. The relative migration of 
these U2-containing complexes corresponded to ~20S 
(peak fractions 8-9), which is higher than the previously 
characterized 17S U2 snRNPs (34,35). 

PRPF40A associates with U2 snRNPs 

PRPF40A was not identified as a U2-associated protein 
in previous isolations of the U2 snRNPs (34,35). 
Moreover, its yeast homologue, Prp40p, is a compo- 
nent of the Ul snRNP and not U2 (36,37). To inves- 
tigate whether there is an alternative form of U2 
snRNPs present in HeLa nuclear extract, we carried 
out IPs with the PRPF40A-specific antibody from re- 
actions in the absence of pre-mRNA. The analysis of 
an eluate by glycerol gradient centrifugation showed that 
U2 particles were selected again (Figure 2a; fractions 
9-11), indicating the presence of such complexes in 
HeLa extract. 

Next, we determined the full protein composition of the 
novel PRPF40A-U2 complex. Proteins were recovered 
from gradient fractions 9-11 containing U2 snRNAs, 
fractionated by SDS-PAGE, and stained with 
Coomassie Blue (Figure 2b). The identities of proteins 
were determined after in-gel digestion with trypsin using 
a Q-Trap 4000 mass spectrometer and are listed in Table 1 
and Supplementary Table SI. Consistent with the presence 
of the U2 snRNA, the complex contained all the 
U2-specific proteins identified in previous isolations 
(34,35). In addition, we identified a number of novel 



components in association with the PRPF40A-U2 
complex, including the SWI/SNF chromatin remodelling 
complex proteins such as ARID1A/1B/2, SM ARC A 1/2/4, 
PBRM1, SMARCB1, SMARCC1/2, SMARCD1, 
SMARCE1, ACTL6A and ACTBL2. In addition to 
these, we identified several transcription factors that 
have been identified as spliceosomal components in 
previous purifications of splicing complexes. These 
include RUVBL2/1 (TIP48/TIP49), RBM25 (RED120/ 
S164), SART3 (TIP110), ARS2, BAT1 (UAP56), 
HTATSF1 (Tat-SFl), YEATS2 and CCNK. Other tran- 
scription factors produced insignificant scores, indicating 
that most of these proteins are more likely to be present in 
sub-stoichiometric amounts. The presence of additional 
proteins correlates with the increased S-value of the 
complex. The common trait shared by these proteins is 
their involvement in the regulation of transcriptional 
elongation by RNAP II, which is coupled to histone modi- 
fication and chromatin remodelling. Therefore, our data 
strongly support the hypothesis that there is 
co-transcriptional recruitment of the U2 snRNP to 
nascent transcripts. 

The E complex is a functional stable spliceosomal 
intermediate 

It was important to test whether the E complex that we 
isolated represents a functional spliceosomal intermediate. 
For this purpose, we treated an extract with RNase H in 
the presence of oligonucleotides complementary to Ul 
and U2 snRNAs (28). The Ul and U2 snRNPs were 
digested (Figure 3a), leading to a reduction in splicing 
activity. Then, we tested whether the isolated E complex 
would support splicing of the substrate pre-mRNA in this 
extract. Splicing in the E complex was unaffected by the 
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Figure 2. Characterization of the PRPF40A-U2 snRNP complex, (a) The PRPF40A-specific antibodies precipitate U2 snRNPs from HeLa nuclear 
extract. The IPs were carried out from reactions containing pre-mRNA (left) or without pre-mRNA (right). Eluates from both IPs were fractionated 
by 10-30% glycerol gradients. RNA from the odd fractions was analysed by the 10% denaturing PAGE and visualized by silver staining. 
The identity of the RNAs is indicated on the left, (b) Identification of U2 snRNP-associated proteins. Proteins from the peak fractions of the 
U2 snRNP gradient (fractions 9-11) were recovered, fractionated by SDS-PAGE on the 10/13% polyacrylamide gel and stained with Coomassie 
Blue. The identity of proteins was determined by MS. 
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Table 1. Proteins of the E complex and novel U2 snRNPs 



Gene name 


Mol Mass 


E complex 


Ab to PRPF40A 


E complex 


Ab to U1A 


U2 snRNP Ab to PF 


pept 


emPAI 


pept 


emPAI 


pept 


emP^ 


Sm proteins 
















SNRPB 


24765 


5 


0.33 


6 


0.77 


6 


N/A 


SNRPD1 


13 273 


1 


N/A 


1 


0.3 


2 


N/A 


SNRPD2 


13 632 


9 


4.87 


11 


6.56 






SNRPD3 


14021 


2 


0.64 


5 


0.64 


1 


N/A 


SNRPE 


10 854 


2 


0.87 


2 


0.87 


1 


N/A 


SNRNPF 


9 776 


1 


N/A 






5 


N/A 


SNRPG 


8 547 


3 


N/A 










Ul snRNP 
















SNRP70 


51583 


4 


0.32 


12 


0.52 






SNRPA 


31259 


4 


0.41 


7 


0.77 


1 


N/A 


SNRPC 


17 552 


3 




3 


0.82 






U2 snRNP 
















SNRPA1 


28 512 


7 


1.11 


6 


1.11 


11 


N/A 


SNRNPB2 


25470 


4 


0.75 


3 


0.52 


5 


N/A 


SF3A1 


88 888 


19 


0.92 


38 


2.13 


40 


2.4 


SF3A2 


49 338 


8 


0.44 


10 


0.67 


8 


0.44 


SF3A3 


59154 


22 


0.73 


15 


1.08 


20 


1.65 


SF3B1 


146464 


55 


1.39 


45 


1.06 


58 


1.84 


SF3B2 


97710 


29 


1.18 


15 


0.4 


33 


1.35 


SF3B3 


136 575 


23 


0.62 


25 


0.62 


31 


0.9 


SF3B4 


44414 


2 


0.18 


2 


0.08 


3 


0.27 


SF3B14 


14690 


8 


3.13 


8 


3.13 


6 


N/A 


PHF5A 


13138 


1 


N/A 






2 


N/A 


U2 related 
















PRP40A 


109 022 


9 


0.26 


9 


0.31 


5 


0.14 


PAF60 


60009 


13 


0.94 


29 


2.99 


3 


0.06 


DDX46 


117 803 


17 


0.59 


5 


0.17 






DHX15 


91673 


11 


0.49 


1 




30 


1.21 


SR140 


118 675 


2 


0.03 






13 


0.24 


CHERP 


104078 










14 


0.47 


U4/U6.U5 tri-snRNP 
















PRPF8 


274738 


52 


0.7 






2 


0.03 


SNRNP200 


246006 


36 


0.54 






3 


0.03 


EFTUD2 


110336 


13 


0.48 


8 


0.22 


4 


0.07 


PRPF6 


107 656 


8 


0.27 


1 


0.03 






DDX23 


95 866 


2 


0.04 


2 


0.08 






PRPF3 


77 652 


14 


0.52 










PRPF31 


55 649 


4 


0.21 










PRPF4 


59 097 


4 


0.28 


2 


0.06 






PPIH 


19481 


4 


N/A 










LSM4 


15511 


1 


0.25 










SART1 


90 371 


2 


0.08 










USP39 


65 739 


2 


0.12 










SNRNP40 


39 742 










1 


N/A 


SMN associated 
















SMN1 


32285 


1 


0.12 


1 


0.12 






SIP1 


32021 


2 


0.25 










DDX20 


92981 


3 


0.08 


3 


0.08 






GEMIN4 


121681 


9 


0.27 


23 


0.57 






GEMIN5 


170 793 


2 


0.04 


9 


0.21 






GEMIN6 


18983 


1 


0.2 


3 


0.74 






GEMIN8 


28 904 


1 


0.13 










STRAP 


38 756 






6 


0.32 






Splicing related 
















NCBP1 


92 864 


24 


1.36 


32 


1.65 






NCBP2 


18161 


1 


0.21 


4 


0.21 






U2AF1 


28 368 


4 


0.46 


2 


0.21 






U2AF2 


53 809 


14 


1.39 


7 


0.49 






SF1 








8 


0.45 






SRRM2 


300179 


8 


0.08 


7 


0.06 






SFRS1 


27 842 


10 


1.78 


7 


0.89 


1 


N/A 


SFRS2 


25461 


2 


0.32 










SFRS2B 


32 382 


3 


0.25 


1 


0.12 






SFRS3 


19 546 


2 


0.43 










SFRS4 


56 759 


2 


0.07 










SFRS5 


31359 






5 


0.57 






SFRS6 


39 677 


2 


0.2 


3 


0.31 






SFRS7 


27 578 


4 


0.29 


5 


0.68 






SFRS8 


105190 










1 


0.04 


SFRS11 


53 624 


2 


0.07 










SFRS12 


59402 


2 


0.06 










SFRS15 


126132 




0.03 











(continued) 
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Table 1. Continued 



Gene name Mol Mass E complex Ab to PRPF40A E complex Ab to U1A U2 snRNP Ab to PRPF40A 







pept 


emPAI 


pept 


emPAI 


pept 


emPAI 


HNRNPAO 


30993 


2 


0.26 










HNRPA1L3 


34 373 


2 


0.11 


2 


0.23 


2 


N/A 


HNRNPA2B1 


37464 


1 


0.1 


1 


0.1 


6 


N/A 


HNRNPC 


33 707 


1 


0.11 


2 


0.24 






HNRNPD 


38 581 






1 


0.1 






HNRNPF 


45 985 


2 


0.08 










HNRNPH1 


49484 


1 


0.08 










HNRNPK 


51230 


1 


0.07 










HNRNPL 


64720 






2 


0.12 






HNRNPM 


77 749 






6 


0.2 






SYNCRIP 


69 788 


14 


0.59 


3 


0.11 


6 


0.23 


HNRNPR 


71 184 


6 


0.16 


8 


0.36 


1 


0.05 


HNRNPU 


91 198 


4 


0.13 


3 


0.08 


7 


0.13 


RBM7 


30485 


1 


0.12 


3 


0.42 






RBM10 


103811 










3 


0.07 


RBM17 


45162 


1 


0.08 






8 


0.75 


RBM25 


94407 


34 


1.42 


12 


0.31 


8 


0.21 


RBM39 


59 628 


7 


0.35 


6 


0.27 






RBM42 


50496 


1 


0.07 










RBMXL2 


42929 






4 


0.29 






ARS2 


101060 


18 


0.65 


9 


0.24 


7 


0.11 


LUC7L3 (CROP) 


51834 


6 


0.42 


4 


0.32 






LUC7L2 


46942 


3 


0.08 










PCBP1 


37 987 


2 


0.21 


3 


0.33 






PCBP2 


38 955 






3 


0.32 






SKIV2L2 


118756 


5 


0.13 


3 


0.1 


3 


0.1 


KHSRP 


73 443 






2 


0.1 






FUS 


53 622 






2 


0.14 






DDX5 


69618 


3 


0.17 


6 


0.37 






DDX17 


72953 






3 


0.1 


2 


0.05 


DHX9 


142181 


4 


0.05 










DDX39 


49 611 


1 












PRPF19 


55 603 


7 


0.57 


3 


0.21 






SART3 


110721 


3 


0.1 






6 


0.18 


BCAS2 


26229 


1 


0.15 










ZCCHC8 


79156 


1 


0.05 










DNAJC8 


29 823 










4 


N/A 


THOC4 


26 872 


1 


0.14 










BAT1 


49416 


1 


0.08 






4 


0.16 


YBX1 


35 903 


1 


0.1 










RUVBL2 


51296 


2 


0.15 


2 


0.15 


14 


1.02 


RUVBL1 


50 538 










6 


0.53 


CPSF1 


162036 


1 


0.02 










NUDT21 


26268 


2 


0.14 










CPSF6 


59 344 


1 


0.06 


7 


0.28 






CPSF7 


52189 


1 


0.07 










HTATSF1 


86 371 


13 


0.46 


6 


0.18 


28 


1.52 


Chromatin remodeling 
















ARID 1 A 


242 805 










5 


0.03 


ARID IB 


237 057 










17 


0.22 


ARID2 


198 921 


10 


0.18 






14 


0.25 


SMARCA1 


123211 










1 


0.03 


SMAKLAz 












1 c 

1 J 


U.zz 


SMARCA4 


185100 


27 


0.48 


5 


0.06 


25 


0.45 


PBRM1 


194080 


39 


0.76 






8 


0.08 


SMARCC2 


133196 


25 


0.73 






21 


0.43 


SMARCC1 


123 303 


3 


0.09 






10 


0.27 


SMARCD1 


55195 


2 


0.14 






11 


0.8 


SMARCE1 


46 678 


5 


0.26 






9 


0.59 


ACTL6A 


47 944 


6 


0.46 


3 


0.25 


8 


0.57 


SMARCB1 


44 398 


2 


0.18 






6 


0.38 


ACTBL2 


42318 


2 


0.09 










CHD4 


219 393 


20 


0.26 










CHD8 


263 977 


3 


0.03 






30 


0.39 



The E complex was assembled on MINX pre-mRNA in HeLa nuclear extract and purified using either U1A- or PRPF40A-specific antibodies. In 
addition, the antibody to PRPF40A precipitated novel U2 snRNPs. Proteins from purified complexes were fractionated by SDS-PAGE and sub- 
jected to MS analysis. The presence of a protein in a complex is shown by the number of unique peptides and the emPAI factor (60) — where 
available (note: N/A stands for not available). Calculated molecular masses for proteins and their corresponding gene names are taken from the 
UniProt database. Proteins are grouped according to their known function. Additional proteins identified in purifications which are not common 
components for all complexes are presented in the Supplementary Table SI. 
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Figure 3. The E complex represents a functional spliceosomal intermediate, (a) HeLa extract was depleted of Ul and U2 snRNPs using RNase H 
digestion with complementary oligonucleotides (AU1/U2). Total RNA from the corresponding extracts was analysed by the 10% denaturing PAGE 
and visualized by silver staining. The identity of RNAs is indicated on the left, (b) The time course of splicing in depleted (AU1/U2) and mock (NE) 
extracts. Equal amounts of the E complex isolated using the PRPF40A-specific antibody or naked pre-mRNA were supplemented with the mock 
(NE) or AU1/U2 nuclear extracts and allowed to splice. The aliquots of the reactions were taken at times indicated, RNA extracted, analysed by 
denaturing gel and visualized by exposure to a phosphoimager. The AU1/U2 extract is deficient in splicing naked pre-mRNA but pre-mRNA in the 
E complex is efficiently processed in this extract (compare panels A and B). In the mock extract, pre-mRNA is protected from degradation in the E 
complex as compared to the naked pre-mRNA (panels C and D). The identities of the precursor, splicing intermediates and the product are indicated 
on the right. 



depletion of Ul and U2 snRNPs (Figure 3b; panels A and 
B). This supports the view that the E complex we isolate is 
splicing competent and does not require the association of 
either Ul or U2 snRNPs, which indicates the integrity of 
the complexes. Notably, the E complex generated even 
more spliced product than naked pre-mRNA added to 
intact mock-treated extract (Figure 3b, panels C and D). 
This shows that pre-mRNA was protected from degrad- 
ation in the E complex as compared to the naked 
pre-mRNA. On the other hand, the E complex had no 
kinetic advantage in the course of reaction, suggesting 
that later steps are limiting. 

The protein composition of the E complex 

The proteins in the E complex must be responsible for the 
initial steps in spliceosome formation and the pairing of 
splice sites. Having proved that the isolated E complexes 
represent a functional intermediate step in the spliceosome 
assembly pathway, we analysed their composition. Equal 
amounts of complexes purified by the PRPF40A- or 
UlA-specific antibodies were fractionated by SDS- 
PAGE and stained with Coomassie Blue (Figure 4a). 
Proteins identified by MS analysis are listed in Table 1 
and Supplementary Table SI. 

The comparison of protein compositions revealed that 
almost all proteins identified in the complex purified with 
the UlA-specific antibody were present in the complex 
isolated with the antibody to PRPF40A. Overall, more 
proteins were identified in the preparation with the 
PRPF40A-specific antibody and some common proteins 



were detected by a larger number of peptides. These ob- 
servations are consistent with the fact that the purification 
procedure used to isolate complexes with the antibody to 
U1A was more rigorous and included fractionation of the 
complexes by glycerol gradient prior to affinity purifica- 
tion. This could result in the loss of more weakly 
associated proteins. For example, the U4/U6.U5 
tri-snRNP proteins were better represented in the 
PRPF40A preparation, indicating the intrinsic affinity of 
the tri-snRNP particle for the pre-spliceosome, but at this 
stage of assembly this interaction is unlikely to be stable 
enough to withstand the conditions used for U1A isola- 
tion. Therefore, we believe that most of the common 
proteins in both isolations represent the more tightly 
bound components of the E complex. 

Consistent with its RNA composition, the E complex 
contained Ul and U2 snRNP-associated proteins. Most of 
them were identified by a similar number of peptides in 
both preparations, confirming that equal amounts of 
complexes were subjected to the analysis. The E complex 
also contained proteins that associate with pre-mRNA 
directly. These include the cap-binding complex proteins 
and hnRNP proteins Al, A2/B1, C, Q, R and U together 
with poly(rC)-binding protein (PCBP1), SR proteins 
(SFRS1, 2B, 6, 7), RNA-binding motif proteins 
(RBM25, 39, 7), PUF60 and SR-related protein SRRM2 
(SRp300) and CROP (Luc7A). The DEAD box family 
ATP-dependent RNA helicases DDX5 (p68) and 
DDX46 (hPRP5) were identified in both preparations. 
In addition, proteins that affect alternative splicing and 
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Gradient Fractions 

GEMIN6/7 



Figure 4. Protein composition of the E complex, (a) Proteins from both preparations of the E complex isolated using either U1A- or 
PRPF40A-specific antibodies were fractionated by 8%/14% SDS-PAGE and stained with Coomassie Blue. The identities of proteins were 
determined by MS. (b) The U1A preparation of the E complex was analysed by 10-30% glycerol gradient and the gradient fractions were 
counted by Cherenkov to follow pre-mRNA distribution, (c) RNA and protein composition of U1A gradient fractions. RNA was extracted from 
the gradient fractions, separated by 10% denaturing PAGE and visualized by silver staining. Proteins from the gradient fractions were separated by 
10% SDS-PAGE and western blotted with antibodies to SMN1, SIP1 and GEMIN6. (d) Complexes isolated with the antibody to PRPF40A were 
fractionated by 10-30% glycerol gradient and pre-mRNA was detected by Cherenkov counting, (e) RNA and protein analysis of gradient fractions 
from the PRPF40A-E complex isolation. The identities of RNA and antibodies are indicated on the left. 
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therefore might influence spliceosome assembly were 
found in association with the E complexes. All these 
proteins were also identified as components of E 
complexes assembled on c-src pre-mRNA (20,21). Their 
presence on both MINX and the neuron- specific substrate 
suggests that they have constitutive functions in splicing. 
One difference between the complexes purified by the 
antibodies to PRPF40A and U1A is that the former 
lacked the proteins SF1 and DDX17 (p72). This did not 
prevent the complex functioning in a commitment assay 
(Figure 3b). 

Significantly, the purified E complexes contain proteins 
of the SMN-associated complex. Although an indirect role 
for SMN in splicing has been suggested previously 
(39-43), this is the first time that the SMN complex has 
been identified as a possible spliceosomal component. The 
complex comprises nine proteins: SMN, GEMINS2-8 and 
STRAP (UNR-IP) (44-49). We identified all of them 
except GEMIN7, although GEMIN2 (SIP1) and 
GEMIN8 were detected only in the PRPF40A-speciflc 
preparation. Thus, it is likely that all proteins of the 
SMN complex co-purify with the E complex. 
Significantly, neither antibody, to U1A or PRPF40A, 
precipitated Ul or U2 snRNPs in association with the 
SMN complex. This is consistent with the idea that the 
presence of the SMN complex in the spliceosomal complex 
E is independent of snRNPs. To verify the MS data, the E 
complex preparations were analysed by western blotting. 
Complexes were purified as previously described and 
analysed by 10-30% glycerol gradients (Figure 4b and 
d). Antibodies against proteins SMN (SMN1), GEMIN2 
(SIP1) and GEMIN6-7 were used for detecting proteins of 
the SMN complex across the entire gradients. The 
designated proteins were detected in fractions correspond- 
ing to the E complex with material purified via both U1A 
and PRPF40A antibodies (Figure 4c and e). This provides 
a strong evidence for the SMN complex proteins as com- 
ponents of the E complex. 

Interestingly, analysis of the complexes purified with the 
antibody to PRPF40A showed two peaks for SMN (frac- 
tions 11, 12 and 14-16). The major peak (fractions 14-16) 
corresponds to the E complex, whereas fractions 1 1 and 12 
do not coincide with any RNA species. It suggests that the 
antibody to PRPF40A also precipitated an SMN- 
containing complex in addition to the E complex and 
U2 snRNP. We infer that PRPF40A-SMN complexes 
might exist in HeLa nuclear extracts. 

In addition, we tested some of the proteins identified in 
the PRPF40A-U2 snRNP complex (Figure 4e). The signal 
produced by an antibody against the U2 snRNP-speciflc 
protein SF3A2 corresponds to the distribution of U2 
snRNA across the gradient. We also used an antibody 
to one of the SWI/SNF complex proteins, ACTL6A, 
and detected it in the PRPF40A-U2 snRNP complex. 

Depletion of the SMN protein 

To test whether SMN plays a role in the formation of the 
E complex, it was depleted from HeLa nuclear extract 
using an antibody to SMN. Depletion was done at two 
different salt concentrations, 200 and 700 mM NaCl. 



Depletion at the higher salt concentration resulted in 
more efficient removal of the proteins associated with 
SMN and their presence was detected in the eluate from 
the beads. Importantly, neither U1A nor SF3A proteins 
were affected, indicating that the SMN complex was 
removed without the Ul or U2 snRNPs (Figure 5a). 
Depletion inactivated the extract (Figure 5b). 

The ability of the SMN-depleted extract to form 
complexes was analysed by glycerol gradient centrifuga- 
tion. The reactions were loaded on the gradient and the 
distribution of pre-mRNA across the gradients was moni- 
tored by Cherenkov counting. The comparison of 
pre-mRNA distributions displayed a striking difference 
between the two extracts (Figure 5c). In the mock-depleted 
extract, the major peak had the expected 35-40S value and 
formation of the E complex was unaffected. In contrast, 
complexes formed in the SMN-depleted extract produced 
a much smaller peak corresponding to the E complex 
(fractions 11-13) and the majority of pre-mRNA was 
found in slower-migrating fractions (7-9) (Figure 5c). To 
analyse complex formation in more detail, RNA was ex- 
tracted from gradient fractions, separated by denaturing 
PAGE and visualized by silver staining (Figure 5d and e). 
In addition to the reduced amount of E complex, the dis- 
tribution of U2 snRNA was significantly different in the 
SMN-depleted extract. The majority of the U2 snRNA 
was found in fractions 4-6, not associated with the 
pre-mRNA. This suggests that association of U2 snRNP 
in the E complex might require SMN and its associated 
proteins. 



DISCUSSION 

The E complex represents the first definite complex in the 
spliceosome assembly process. Previous studies identified 
some of its components but a full characterization of 
complex E on a standard substrate had not been done. 
Here, our objective was to isolate the E complex and char- 
acterize its components in detail. We carried out two in- 
dependent immunopuriflcations with antibodies to U1A 
and PRPF40A and identified their protein composition 
using MS. A comparison of the isolates shows that most 
proteins are common to both purifications and therefore 
are likely to represent true components of the E complex. 
In addition to the known factors, we have identified 
several novel components. One of these is the SMN 
complex, which we show might be required for 
assembly. Analysis of the complexes formed in the 
SMN-depleted extracts showed that U2 snRNP is no 
longer associated with pre-mRNA. Since U2 snRNP is 
essential for the E complex formation, our data suggest 
that the SMN complex plays a role in U2 snRNP inter- 
action within the complex. To confirm this it would be 
necessary to reconstitute the depleted extract with the 
SMN complex. However, reconstitution is not yet 
possible because recombinant proteins of the SMN 
complex are insoluble. For this reason, we cannot estab- 
lish whether the entire SMN complex is necessary for 
spliceosome assembly. 
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Figure 5. Depletions of the SMN and SMN-associated proteins affect E complex formation, (a) Western blot analysis of the mock and 
SMN-depleted extracts with different antibodies. HeLa nuclear extracts were depleted with the antibody to SMN at 200 and 700 mM NaCl and 
analysed by western blot together with the material removed. Mock — mock-treated extract, ASMN — depleted of SMN at 200 and 700 mM salt, 
correspondingly, P — precipitate. The identities of proteins detected by antibodies are shown on the left, (b) Depletion of SMN from the nuclear 
extract resulted in splicing inhibition as compared to the mock-treated extract. The pre-mRNA was incubated under splicing conditions in mock and 
SMN-depleted extracts. The aliquots of the reactions were taken at times indicated, RNA extracted, analysed by denaturing gel and visualized by 

(continued) 
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At present the interactions between U2 snRNPs and 
other factors in the E complex are poorly characterized. 
Much is known about its interactions in the A complex 
where, upon conformational changes induced by ATP hy- 
drolysis, the U2 snRNA becomes base paired to the 
branchpoint sequence and the U2-associated proteins of 
the SF3A/B complex form an anchoring site on the 
pre-mRNA. These rearrangements result in stable associ- 
ation of U2 snRNPs in the A complex. The SMN complex 
is not a component of the A complex and presumably 
dissociated during the transition. It will be of interest to 
elucidate the precise interactions of SMN in the E complex 
in greater detail. 

Deficiency in the SMN protein causes spinal muscular 
atrophy (SMA). SMN and its associated proteins play a 
major role in snRNP biogenesis, and a recent study of an 
SMA mouse model detected widespread alterations in the 
stoichiometry of snRNAs in different tissues (43). The 
splicing of specific pre-mRNAs was inhibited in the 
SMA mice, but it was not established whether this arose 
from the changes in snRNP stoichiometry (43). Similar 
findings were reported for S. pombe when the expression 
of SMN was downregulated (39). The possible role of 
SMN in spliceosome formation may explain such defects 
in pre-mRNA processing observed in SMN-deficient cells. 

Along with the E complex, a novel U2 snRNP- 
containing complex was selected by the antibody to 
PRPF40A from HeLa extract. These particles have a sedi- 
mentation coefficient of ~20S. The earliest purification 
procedure for human U2 snRNPs involved several chro- 
matographic steps and resulted in isolation of 12S and 17S 
U2 snRNPs, which contain only the most stably bound 
components. A more recent procedure used an antibody 
to SF3A2, raised against a peptide (amino acid 444-458). 
In this case a number of additional proteins were identified 
as components of U2, including DDX46 (PRP5), but not 
PRPF40A (35). The antibody to SF3A2 was raised against 
a peptide (amino acid 444-458) located in a proline-rich 
region of the protein that forms the interface with 
PRPF40A (56). Thus, it is possible that the use of this 
epitope in SF3A2 selects against co-purification with 
PRPF40A. 

The major difference between the new and previously 
described U2 snRNPs is the association of the SWI/SNF 
complex proteins with the particles. The SWI/SNF 
complex interacts with chromatin (50), suggesting a 
model in which it might serve as an adaptor for U2 
snRNP association with chromatin. The same SWI/SNF 
proteins were identified as components of the E complex 
in this study. Their recruitment into the spliceosome is 
likely to occur in a complex with the U2 snRNP. 
Interestingly, the BRM protein, a component of the 



SWI/SNF complex, was previously shown to associate 
with nascent transcripts and affect alternative splicing 
(51). This suggests that the SWI/SNF complex might be 
involved in differential recruitment of the U2 snRNP to 
nascent transcripts. 

One of the most conspicuous features of the E complex 
is the proximity between the 5'- and 3'-SS and the branch 
sequence (3,52). Therefore, the identification of factors 
responsible for attaining this conformation is important 
for understanding E complex formation. PRPF40A has 
been proposed to mediate interactions between the 5 / -SS 
and branch sequence (53). In S. cerevisiae, Prp40p is a 
component of the Ul snRNP (36,37,54) and interacts 
with branchpoint binding protein Msl5 (53). The human 
homologue of Prp40p, PRPF40A, has also been shown to 
interact with SF1, a homologue of Msl5 (53,55,56,57). 
Here, we identified PRPF40A as a component of the U2 
snRNPs, but its interacting partners at the 5'-SS are cur- 
rently unknown. It is possible that the use of peptide 
antibodies prevented the recovery of U1-PRPF40A 
complexes. Another proposed candidate for the bridging 
of SSs is a DEAD box family ATP-dependent RNA 
helicase, DDX46 (PRP5), which has been proposed to 
bridge the Ul and U2 snRNPs in the A complex (58). 
We identified DDX46 in the E complex, where it might 
also be involved in bridging interactions. Alternatively, 
SR proteins have been proposed to link the Ul snRNP 
bound at the 5'-SS with U2AF bound at the 3'-SS (59). 
Thus, it is plausible that many interactions between com- 
ponents of the E complex contribute to the link between 
the ends of the intron. The least understood are the inter- 
actions of the U2 snRNP in the E complex. Base pairing 
between the U2 snRNA and pre-mRNA has not been 
detected in the E complex and moreover, its association 
does not require a branchpoint sequence (5). Therefore, 
elucidating the interactions of SMN-associated proteins in 
the E complex might reveal the means by which U2 
snRNPs associated with the complex. 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online: 
Supplementary Table SI and Supplementary 
Reference [60]. 
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Figure 5. Continued 

exposure to a phosphoimager. Identities of the RNA are indicated on the right, (c) The E complex is not formed in the SMN-depleted extract as 
opposed to the mock-treated extract (fractions 11-13). Complexes assembled on pre-mRNA in mock and SMN-depleted extracts were fractionated 
by 10-50% glycerol gradient centrifugation. Gradient fractions were collected manually from the top to the bottom and counted by Cherenkov. 
(d) The RNA analysis of the E complex formed in mock-depleted extracts. RNA extracted from the gradient fractions was separated by the 10% 
denaturing PAGE and visualized by silver staining. Fractions 10-13 correspond to the E complex, (e) The RNA analysis of complexes assembled in 
the SMN-depleted extracts. The peak corresponding to the E complex is missing and pre-mRNA is found in non-productive complexes with amounts 
of the U2 snRNA greatly reduced. The identity of RNAs is indicated on the left. The asterisks correspond to the 5S and 5.8S ribosomal RNAs. 
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